Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 105908 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 263 |
| Duplicate rows (%) | 0.2% |
| Total size in memory | 11.3 MiB |
| Average record size in memory | 112.0 B |
Variable types
| NUM | 14 |
|---|
Reproduction
| Analysis started | 2020-08-25 01:52:00.659090 |
|---|---|
| Analysis finished | 2020-08-25 01:52:38.381072 |
| Duration | 37.72 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
| Dataset has 263 (0.2%) duplicate rows | Duplicates |
V0 has 82464 (77.9%) zeros | Zeros |
V1 has 11841 (11.2%) zeros | Zeros |
V2 has 15532 (14.7%) zeros | Zeros |
V3 has 76247 (72.0%) zeros | Zeros |
V5 has 49127 (46.4%) zeros | Zeros |
V6 has 1129 (1.1%) zeros | Zeros |
V8 has 9520 (9.0%) zeros | Zeros |
V10 has 12144 (11.5%) zeros | Zeros |
target has 21359 (20.2%) zeros | Zeros |
| Distinct count | 11 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.7567889111304151 |
|---|---|
| Minimum | 0.0 |
| Maximum | 10.0 |
| Zeros | 82464 |
| Zeros (%) | 77.9% |
| Memory size | 827.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 5 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.957220847 |
|---|---|
| Coefficient of variation (CV) | 2.58621766 |
| Kurtosis | 10.64785256 |
| Mean | 0.7567889111 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.259941101 |
| Sum | 80150 |
| Variance | 3.830713445 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 82464 | 77.9% | |
| 1 | 7970 | 7.5% | |
| 2 | 4767 | 4.5% | |
| 3 | 2783 | 2.6% | |
| 10 | 1924 | 1.8% | |
| 4 | 1781 | 1.7% | |
| 5 | 1267 | 1.2% | |
| 6 | 920 | 0.9% | |
| 7 | 779 | 0.7% | |
| 8 | 652 | 0.6% | |
| 9 | 601 | 0.6% |
| Value | Count | Frequency (%) | |
| 0 | 82464 | 77.9% | |
| 1 | 7970 | 7.5% | |
| 2 | 4767 | 4.5% | |
| 3 | 2783 | 2.6% | |
| 4 | 1781 | 1.7% | |
| 5 | 1267 | 1.2% | |
| 6 | 920 | 0.9% | |
| 7 | 779 | 0.7% | |
| 8 | 652 | 0.6% | |
| 9 | 601 | 0.6% |
| Value | Count | Frequency (%) | |
| 10 | 1924 | 1.8% | |
| 9 | 601 | 0.6% | |
| 8 | 652 | 0.6% | |
| 7 | 779 | 0.7% | |
| 6 | 920 | 0.9% | |
| 5 | 1267 | 1.2% | |
| 4 | 1781 | 1.7% | |
| 3 | 2783 | 2.6% | |
| 2 | 4767 | 4.5% | |
| 1 | 7970 | 7.5% |
| Distinct count | 11 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.224515617328247 |
|---|---|
| Minimum | 0.0 |
| Maximum | 10.0 |
| Zeros | 11841 |
| Zeros (%) | 11.2% |
| Memory size | 827.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.910999864 |
|---|---|
| Coefficient of variation (CV) | 0.6890730507 |
| Kurtosis | -0.915051671 |
| Mean | 4.224515617 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.2843771158 |
| Sum | 447410 |
| Variance | 8.47392021 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 2 | 12095 | 11.4% | |
| 0 | 11841 | 11.2% | |
| 3 | 11731 | 11.1% | |
| 4 | 11577 | 10.9% | |
| 1 | 11351 | 10.7% | |
| 5 | 11255 | 10.6% | |
| 6 | 10427 | 9.8% | |
| 7 | 8992 | 8.5% | |
| 8 | 6999 | 6.6% | |
| 10 | 5835 | 5.5% | |
| 9 | 3805 | 3.6% |
| Value | Count | Frequency (%) | |
| 0 | 11841 | 11.2% | |
| 1 | 11351 | 10.7% | |
| 2 | 12095 | 11.4% | |
| 3 | 11731 | 11.1% | |
| 4 | 11577 | 10.9% | |
| 5 | 11255 | 10.6% | |
| 6 | 10427 | 9.8% | |
| 7 | 8992 | 8.5% | |
| 8 | 6999 | 6.6% | |
| 9 | 3805 | 3.6% |
| Value | Count | Frequency (%) | |
| 10 | 5835 | 5.5% | |
| 9 | 3805 | 3.6% | |
| 8 | 6999 | 6.6% | |
| 7 | 8992 | 8.5% | |
| 6 | 10427 | 9.8% | |
| 5 | 11255 | 10.6% | |
| 4 | 11577 | 10.9% | |
| 3 | 11731 | 11.1% | |
| 2 | 12095 | 11.4% | |
| 1 | 11351 | 10.7% |
| Distinct count | 11 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.382756732258186 |
|---|---|
| Minimum | 0.0 |
| Maximum | 10.0 |
| Zeros | 15532 |
| Zeros (%) | 14.7% |
| Memory size | 827.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 4 |
| Q3 | 7 |
| 95-th percentile | 9 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.090336975 |
|---|---|
| Coefficient of variation (CV) | 0.7051125954 |
| Kurtosis | -1.295707287 |
| Mean | 4.382756732 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.07479127684 |
| Sum | 464169 |
| Variance | 9.550182616 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 15532 | 14.7% | |
| 8 | 11326 | 10.7% | |
| 9 | 10340 | 9.8% | |
| 3 | 10337 | 9.8% | |
| 1 | 10237 | 9.7% | |
| 2 | 9798 | 9.3% | |
| 6 | 9378 | 8.9% | |
| 4 | 9168 | 8.7% | |
| 7 | 9091 | 8.6% | |
| 5 | 8786 | 8.3% | |
| 10 | 1915 | 1.8% |
| Value | Count | Frequency (%) | |
| 0 | 15532 | 14.7% | |
| 1 | 10237 | 9.7% | |
| 2 | 9798 | 9.3% | |
| 3 | 10337 | 9.8% | |
| 4 | 9168 | 8.7% | |
| 5 | 8786 | 8.3% | |
| 6 | 9378 | 8.9% | |
| 7 | 9091 | 8.6% | |
| 8 | 11326 | 10.7% | |
| 9 | 10340 | 9.8% |
| Value | Count | Frequency (%) | |
| 10 | 1915 | 1.8% | |
| 9 | 10340 | 9.8% | |
| 8 | 11326 | 10.7% | |
| 7 | 9091 | 8.6% | |
| 6 | 9378 | 8.9% | |
| 5 | 8786 | 8.3% | |
| 4 | 9168 | 8.7% | |
| 3 | 10337 | 9.8% | |
| 2 | 9798 | 9.3% | |
| 1 | 10237 | 9.7% |
| Distinct count | 9 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.8173603504928806 |
|---|---|
| Minimum | 0.0 |
| Maximum | 10.0 |
| Zeros | 76247 |
| Zeros (%) | 72.0% |
| Memory size | 827.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 3.398135035 |
|---|---|
| Coefficient of variation (CV) | 1.869819067 |
| Kurtosis | 1.166397569 |
| Mean | 1.81736035 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.664817053 |
| Sum | 192473 |
| Variance | 11.54732172 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 76247 | 72.0% | |
| 10 | 11900 | 11.2% | |
| 2 | 7448 | 7.0% | |
| 5 | 5205 | 4.9% | |
| 7 | 2218 | 2.1% | |
| 4 | 1510 | 1.4% | |
| 9 | 804 | 0.8% | |
| 6 | 429 | 0.4% | |
| 8 | 147 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 76247 | 72.0% | |
| 2 | 7448 | 7.0% | |
| 4 | 1510 | 1.4% | |
| 5 | 5205 | 4.9% | |
| 6 | 429 | 0.4% | |
| 7 | 2218 | 2.1% | |
| 8 | 147 | 0.1% | |
| 9 | 804 | 0.8% | |
| 10 | 11900 | 11.2% |
| Value | Count | Frequency (%) | |
| 10 | 11900 | 11.2% | |
| 9 | 804 | 0.8% | |
| 8 | 147 | 0.1% | |
| 7 | 2218 | 2.1% | |
| 6 | 429 | 0.4% | |
| 5 | 5205 | 4.9% | |
| 4 | 1510 | 1.4% | |
| 2 | 7448 | 7.0% | |
| 0 | 76247 | 72.0% |
V4
Real number (ℝ≥0)
| Distinct count | 11 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.804726743966461 |
|---|---|
| Minimum | 0.0 |
| Maximum | 10.0 |
| Zeros | 376 |
| Zeros (%) | 0.4% |
| Memory size | 827.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 5 |
| median | 7 |
| Q3 | 8 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.981436682 |
|---|---|
| Coefficient of variation (CV) | 0.2911853416 |
| Kurtosis | -0.3641316493 |
| Mean | 6.804726744 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.445608642 |
| Sum | 720675 |
| Variance | 3.926091323 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 9 | 20536 | 19.4% | |
| 8 | 17637 | 16.7% | |
| 7 | 17615 | 16.6% | |
| 6 | 16484 | 15.6% | |
| 5 | 13207 | 12.5% | |
| 4 | 9101 | 8.6% | |
| 10 | 5533 | 5.2% | |
| 3 | 4235 | 4.0% | |
| 2 | 888 | 0.8% | |
| 0 | 376 | 0.4% | |
| 1 | 296 | 0.3% |
| Value | Count | Frequency (%) | |
| 0 | 376 | 0.4% | |
| 1 | 296 | 0.3% | |
| 2 | 888 | 0.8% | |
| 3 | 4235 | 4.0% | |
| 4 | 9101 | 8.6% | |
| 5 | 13207 | 12.5% | |
| 6 | 16484 | 15.6% | |
| 7 | 17615 | 16.6% | |
| 8 | 17637 | 16.7% | |
| 9 | 20536 | 19.4% |
| Value | Count | Frequency (%) | |
| 10 | 5533 | 5.2% | |
| 9 | 20536 | 19.4% | |
| 8 | 17637 | 16.7% | |
| 7 | 17615 | 16.6% | |
| 6 | 16484 | 15.6% | |
| 5 | 13207 | 12.5% | |
| 4 | 9101 | 8.6% | |
| 3 | 4235 | 4.0% | |
| 2 | 888 | 0.8% | |
| 1 | 296 | 0.3% |
| Distinct count | 11 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7384616837254976 |
|---|---|
| Minimum | 0.0 |
| Maximum | 10.0 |
| Zeros | 49127 |
| Zeros (%) | 46.4% |
| Memory size | 827.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 7 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.410721193 |
|---|---|
| Coefficient of variation (CV) | 1.386697915 |
| Kurtosis | 1.911485609 |
| Mean | 1.738461684 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.605411866 |
| Sum | 184117 |
| Variance | 5.811576668 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 49127 | 46.4% | |
| 1 | 17760 | 16.8% | |
| 2 | 11593 | 10.9% | |
| 3 | 7747 | 7.3% | |
| 4 | 5359 | 5.1% | |
| 5 | 3898 | 3.7% | |
| 6 | 3020 | 2.9% | |
| 7 | 2543 | 2.4% | |
| 8 | 2052 | 1.9% | |
| 9 | 1423 | 1.3% | |
| 10 | 1386 | 1.3% |
| Value | Count | Frequency (%) | |
| 0 | 49127 | 46.4% | |
| 1 | 17760 | 16.8% | |
| 2 | 11593 | 10.9% | |
| 3 | 7747 | 7.3% | |
| 4 | 5359 | 5.1% | |
| 5 | 3898 | 3.7% | |
| 6 | 3020 | 2.9% | |
| 7 | 2543 | 2.4% | |
| 8 | 2052 | 1.9% | |
| 9 | 1423 | 1.3% |
| Value | Count | Frequency (%) | |
| 10 | 1386 | 1.3% | |
| 9 | 1423 | 1.3% | |
| 8 | 2052 | 1.9% | |
| 7 | 2543 | 2.4% | |
| 6 | 3020 | 2.9% | |
| 5 | 3898 | 3.7% | |
| 4 | 5359 | 5.1% | |
| 3 | 7747 | 7.3% | |
| 2 | 11593 | 10.9% | |
| 1 | 17760 | 16.8% |
| Distinct count | 11 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.091796653699437 |
|---|---|
| Minimum | 0.0 |
| Maximum | 10.0 |
| Zeros | 1129 |
| Zeros (%) | 1.1% |
| Memory size | 827.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 5 |
| median | 6 |
| Q3 | 8 |
| 95-th percentile | 9 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.026935694 |
|---|---|
| Coefficient of variation (CV) | 0.3327320016 |
| Kurtosis | 0.04899455946 |
| Mean | 6.091796654 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.470656649 |
| Sum | 645170 |
| Variance | 4.108468306 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 6 | 19777 | 18.7% | |
| 7 | 19523 | 18.4% | |
| 8 | 16906 | 16.0% | |
| 5 | 16865 | 15.9% | |
| 4 | 10664 | 10.1% | |
| 9 | 8704 | 8.2% | |
| 3 | 5585 | 5.3% | |
| 2 | 2876 | 2.7% | |
| 10 | 2544 | 2.4% | |
| 1 | 1335 | 1.3% | |
| 0 | 1129 | 1.1% |
| Value | Count | Frequency (%) | |
| 0 | 1129 | 1.1% | |
| 1 | 1335 | 1.3% | |
| 2 | 2876 | 2.7% | |
| 3 | 5585 | 5.3% | |
| 4 | 10664 | 10.1% | |
| 5 | 16865 | 15.9% | |
| 6 | 19777 | 18.7% | |
| 7 | 19523 | 18.4% | |
| 8 | 16906 | 16.0% | |
| 9 | 8704 | 8.2% |
| Value | Count | Frequency (%) | |
| 10 | 2544 | 2.4% | |
| 9 | 8704 | 8.2% | |
| 8 | 16906 | 16.0% | |
| 7 | 19523 | 18.4% | |
| 6 | 19777 | 18.7% | |
| 5 | 16865 | 15.9% | |
| 4 | 10664 | 10.1% | |
| 3 | 5585 | 5.3% | |
| 2 | 2876 | 2.7% | |
| 1 | 1335 | 1.3% |
V7
Real number (ℝ≥0)
| Distinct count | 11 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.475724213468293 |
|---|---|
| Minimum | 0.0 |
| Maximum | 10.0 |
| Zeros | 53 |
| Zeros (%) | 0.1% |
| Memory size | 827.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 5 |
| median | 6 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.242115025 |
|---|---|
| Coefficient of variation (CV) | 0.2268403186 |
| Kurtosis | 0.1197024024 |
| Mean | 5.475724213 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.2606739656 |
| Sum | 579923 |
| Variance | 1.542849736 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 6 | 33351 | 31.5% | |
| 5 | 29448 | 27.8% | |
| 7 | 17313 | 16.3% | |
| 4 | 15650 | 14.8% | |
| 3 | 5179 | 4.9% | |
| 8 | 3593 | 3.4% | |
| 2 | 931 | 0.9% | |
| 9 | 267 | 0.3% | |
| 1 | 110 | 0.1% | |
| 0 | 53 | 0.1% | |
| 10 | 13 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 53 | 0.1% | |
| 1 | 110 | 0.1% | |
| 2 | 931 | 0.9% | |
| 3 | 5179 | 4.9% | |
| 4 | 15650 | 14.8% | |
| 5 | 29448 | 27.8% | |
| 6 | 33351 | 31.5% | |
| 7 | 17313 | 16.3% | |
| 8 | 3593 | 3.4% | |
| 9 | 267 | 0.3% |
| Value | Count | Frequency (%) | |
| 10 | 13 | < 0.1% | |
| 9 | 267 | 0.3% | |
| 8 | 3593 | 3.4% | |
| 7 | 17313 | 16.3% | |
| 6 | 33351 | 31.5% | |
| 5 | 29448 | 27.8% | |
| 4 | 15650 | 14.8% | |
| 3 | 5179 | 4.9% | |
| 2 | 931 | 0.9% | |
| 1 | 110 | 0.1% |
| Distinct count | 11 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.6082826604222533 |
|---|---|
| Minimum | 0.0 |
| Maximum | 10.0 |
| Zeros | 9520 |
| Zeros (%) | 9.0% |
| Memory size | 827.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.765890718 |
|---|---|
| Coefficient of variation (CV) | 0.6770319584 |
| Kurtosis | 0.3191945302 |
| Mean | 2.60828266 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7324272438 |
| Sum | 276238 |
| Variance | 3.118370027 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 2 | 26158 | 24.7% | |
| 1 | 21640 | 20.4% | |
| 3 | 19952 | 18.8% | |
| 4 | 12578 | 11.9% | |
| 0 | 9520 | 9.0% | |
| 5 | 8605 | 8.1% | |
| 6 | 4434 | 4.2% | |
| 7 | 2018 | 1.9% | |
| 8 | 747 | 0.7% | |
| 9 | 177 | 0.2% | |
| 10 | 79 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 9520 | 9.0% | |
| 1 | 21640 | 20.4% | |
| 2 | 26158 | 24.7% | |
| 3 | 19952 | 18.8% | |
| 4 | 12578 | 11.9% | |
| 5 | 8605 | 8.1% | |
| 6 | 4434 | 4.2% | |
| 7 | 2018 | 1.9% | |
| 8 | 747 | 0.7% | |
| 9 | 177 | 0.2% |
| Value | Count | Frequency (%) | |
| 10 | 79 | 0.1% | |
| 9 | 177 | 0.2% | |
| 8 | 747 | 0.7% | |
| 7 | 2018 | 1.9% | |
| 6 | 4434 | 4.2% | |
| 5 | 8605 | 8.1% | |
| 4 | 12578 | 11.9% | |
| 3 | 19952 | 18.8% | |
| 2 | 26158 | 24.7% | |
| 1 | 21640 | 20.4% |
V9
Real number (ℝ≥0)
| Distinct count | 11 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.024577935566718 |
|---|---|
| Minimum | 0.0 |
| Maximum | 10.0 |
| Zeros | 830 |
| Zeros (%) | 0.8% |
| Memory size | 827.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 5 |
| Q3 | 6 |
| 95-th percentile | 8 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.881087861 |
|---|---|
| Coefficient of variation (CV) | 0.374377288 |
| Kurtosis | 0.02855207302 |
| Mean | 5.024577936 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.1166187292 |
| Sum | 532143 |
| Variance | 3.538491541 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 5 | 24716 | 23.3% | |
| 6 | 19004 | 17.9% | |
| 4 | 18979 | 17.9% | |
| 3 | 12889 | 12.2% | |
| 7 | 11198 | 10.6% | |
| 8 | 6320 | 6.0% | |
| 2 | 6099 | 5.8% | |
| 9 | 2368 | 2.2% | |
| 1 | 1950 | 1.8% | |
| 10 | 1555 | 1.5% | |
| 0 | 830 | 0.8% |
| Value | Count | Frequency (%) | |
| 0 | 830 | 0.8% | |
| 1 | 1950 | 1.8% | |
| 2 | 6099 | 5.8% | |
| 3 | 12889 | 12.2% | |
| 4 | 18979 | 17.9% | |
| 5 | 24716 | 23.3% | |
| 6 | 19004 | 17.9% | |
| 7 | 11198 | 10.6% | |
| 8 | 6320 | 6.0% | |
| 9 | 2368 | 2.2% |
| Value | Count | Frequency (%) | |
| 10 | 1555 | 1.5% | |
| 9 | 2368 | 2.2% | |
| 8 | 6320 | 6.0% | |
| 7 | 11198 | 10.6% | |
| 6 | 19004 | 17.9% | |
| 5 | 24716 | 23.3% | |
| 4 | 18979 | 17.9% | |
| 3 | 12889 | 12.2% | |
| 2 | 6099 | 5.8% | |
| 1 | 1950 | 1.8% |
| Distinct count | 11 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.948313630698342 |
|---|---|
| Minimum | 0.0 |
| Maximum | 10.0 |
| Zeros | 12144 |
| Zeros (%) | 11.5% |
| Memory size | 827.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 5 |
| Q3 | 7 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.061111088 |
|---|---|
| Coefficient of variation (CV) | 0.6186170313 |
| Kurtosis | -1.07763451 |
| Mean | 4.948313631 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.07926101763 |
| Sum | 524066 |
| Variance | 9.370401095 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 6 | 13030 | 12.3% | |
| 0 | 12144 | 11.5% | |
| 7 | 12114 | 11.4% | |
| 5 | 10772 | 10.2% | |
| 2 | 9864 | 9.3% | |
| 8 | 9458 | 8.9% | |
| 4 | 8854 | 8.4% | |
| 3 | 8736 | 8.2% | |
| 10 | 8324 | 7.9% | |
| 9 | 6795 | 6.4% | |
| 1 | 5817 | 5.5% |
| Value | Count | Frequency (%) | |
| 0 | 12144 | 11.5% | |
| 1 | 5817 | 5.5% | |
| 2 | 9864 | 9.3% | |
| 3 | 8736 | 8.2% | |
| 4 | 8854 | 8.4% | |
| 5 | 10772 | 10.2% | |
| 6 | 13030 | 12.3% | |
| 7 | 12114 | 11.4% | |
| 8 | 9458 | 8.9% | |
| 9 | 6795 | 6.4% |
| Value | Count | Frequency (%) | |
| 10 | 8324 | 7.9% | |
| 9 | 6795 | 6.4% | |
| 8 | 9458 | 8.9% | |
| 7 | 12114 | 11.4% | |
| 6 | 13030 | 12.3% | |
| 5 | 10772 | 10.2% | |
| 4 | 8854 | 8.4% | |
| 3 | 8736 | 8.2% | |
| 2 | 9864 | 9.3% | |
| 1 | 5817 | 5.5% |
V11
Real number (ℝ≥0)
| Distinct count | 11 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.874806435774445 |
|---|---|
| Minimum | 0.0 |
| Maximum | 10.0 |
| Zeros | 1027 |
| Zeros (%) | 1.0% |
| Memory size | 827.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 6 |
| Q3 | 8 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.595990743 |
|---|---|
| Coefficient of variation (CV) | 0.4418853235 |
| Kurtosis | -0.8496571744 |
| Mean | 5.874806436 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.02565655058 |
| Sum | 622189 |
| Variance | 6.739167935 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 5 | 15987 | 15.1% | |
| 6 | 15126 | 14.3% | |
| 10 | 14889 | 14.1% | |
| 4 | 12392 | 11.7% | |
| 7 | 11919 | 11.3% | |
| 3 | 10307 | 9.7% | |
| 8 | 8526 | 8.1% | |
| 2 | 6863 | 6.5% | |
| 9 | 5985 | 5.7% | |
| 1 | 2887 | 2.7% | |
| 0 | 1027 | 1.0% |
| Value | Count | Frequency (%) | |
| 0 | 1027 | 1.0% | |
| 1 | 2887 | 2.7% | |
| 2 | 6863 | 6.5% | |
| 3 | 10307 | 9.7% | |
| 4 | 12392 | 11.7% | |
| 5 | 15987 | 15.1% | |
| 6 | 15126 | 14.3% | |
| 7 | 11919 | 11.3% | |
| 8 | 8526 | 8.1% | |
| 9 | 5985 | 5.7% |
| Value | Count | Frequency (%) | |
| 10 | 14889 | 14.1% | |
| 9 | 5985 | 5.7% | |
| 8 | 8526 | 8.1% | |
| 7 | 11919 | 11.3% | |
| 6 | 15126 | 14.3% | |
| 5 | 15987 | 15.1% | |
| 4 | 12392 | 11.7% | |
| 3 | 10307 | 9.7% | |
| 2 | 6863 | 6.5% | |
| 1 | 2887 | 2.7% |
V12
Real number (ℝ≥0)
| Distinct count | 11 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.050241719228009 |
|---|---|
| Minimum | 0.0 |
| Maximum | 10.0 |
| Zeros | 508 |
| Zeros (%) | 0.5% |
| Memory size | 827.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 7 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.596247452 |
|---|---|
| Coefficient of variation (CV) | 0.3941116513 |
| Kurtosis | -0.3796163217 |
| Mean | 4.050241719 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.1248314465 |
| Sum | 428953 |
| Variance | 2.548005928 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 4 | 30290 | 28.6% | |
| 3 | 19453 | 18.4% | |
| 5 | 16378 | 15.5% | |
| 6 | 13789 | 13.0% | |
| 2 | 13543 | 12.8% | |
| 7 | 6190 | 5.8% | |
| 1 | 4535 | 4.3% | |
| 8 | 1143 | 1.1% | |
| 0 | 508 | 0.5% | |
| 9 | 75 | 0.1% | |
| 10 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 508 | 0.5% | |
| 1 | 4535 | 4.3% | |
| 2 | 13543 | 12.8% | |
| 3 | 19453 | 18.4% | |
| 4 | 30290 | 28.6% | |
| 5 | 16378 | 15.5% | |
| 6 | 13789 | 13.0% | |
| 7 | 6190 | 5.8% | |
| 8 | 1143 | 1.1% | |
| 9 | 75 | 0.1% |
| Value | Count | Frequency (%) | |
| 10 | 4 | < 0.1% | |
| 9 | 75 | 0.1% | |
| 8 | 1143 | 1.1% | |
| 7 | 6190 | 5.8% | |
| 6 | 13789 | 13.0% | |
| 5 | 16378 | 15.5% | |
| 4 | 30290 | 28.6% | |
| 3 | 19453 | 18.4% | |
| 2 | 13543 | 12.8% | |
| 1 | 4535 | 4.3% |
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.952439853457718 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 21359 |
| Zeros (%) | 20.2% |
| Memory size | 827.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.417446185 |
|---|---|
| Coefficient of variation (CV) | 0.7259871193 |
| Kurtosis | 0.1886292561 |
| Mean | 1.952439853 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.6113606184 |
| Sum | 206779 |
| Variance | 2.009153687 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 2 | 52698 | 49.8% | |
| 0 | 21359 | 20.2% | |
| 5 | 11967 | 11.3% | |
| 3 | 10832 | 10.2% | |
| 1 | 9052 | 8.5% |
| Value | Count | Frequency (%) | |
| 0 | 21359 | 20.2% | |
| 1 | 9052 | 8.5% | |
| 2 | 52698 | 49.8% | |
| 3 | 10832 | 10.2% | |
| 5 | 11967 | 11.3% |
| Value | Count | Frequency (%) | |
| 5 | 11967 | 11.3% | |
| 3 | 10832 | 10.2% | |
| 2 | 52698 | 49.8% | |
| 1 | 9052 | 8.5% | |
| 0 | 21359 | 20.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| V0 | V1 | V2 | V3 | V4 | V5 | V6 | V7 | V8 | V9 | V10 | V11 | V12 | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.0 | 2.0 | 2.0 | 0.0 | 8.0 | 3.0 | 7.0 | 5.0 | 1.0 | 3.0 | 7.0 | 5.0 | 5.0 | 1 |
| 1 | 0.0 | 5.0 | 9.0 | 0.0 | 6.0 | 0.0 | 7.0 | 6.0 | 1.0 | 7.0 | 2.0 | 9.0 | 3.0 | 0 |
| 2 | 0.0 | 5.0 | 7.0 | 0.0 | 6.0 | 1.0 | 6.0 | 7.0 | 5.0 | 5.0 | 5.0 | 6.0 | 5.0 | 2 |
| 3 | 0.0 | 0.0 | 0.0 | 10.0 | 7.0 | 0.0 | 4.0 | 7.0 | 3.0 | 6.0 | 4.0 | 6.0 | 4.0 | 5 |
| 4 | 5.0 | 6.0 | 8.0 | 0.0 | 8.0 | 6.0 | 4.0 | 5.0 | 8.0 | 3.0 | 4.0 | 7.0 | 4.0 | 0 |
| 5 | 0.0 | 5.0 | 4.0 | 0.0 | 9.0 | 4.0 | 4.0 | 5.0 | 2.0 | 7.0 | 4.0 | 7.0 | 6.0 | 1 |
| 6 | 0.0 | 6.0 | 7.0 | 0.0 | 3.0 | 0.0 | 7.0 | 7.0 | 2.0 | 6.0 | 6.0 | 6.0 | 3.0 | 2 |
| 7 | 0.0 | 9.0 | 10.0 | 0.0 | 8.0 | 0.0 | 6.0 | 5.0 | 1.0 | 8.0 | 1.0 | 8.0 | 3.0 | 1 |
| 8 | 0.0 | 5.0 | 7.0 | 0.0 | 6.0 | 1.0 | 6.0 | 6.0 | 2.0 | 7.0 | 2.0 | 8.0 | 4.0 | 2 |
| 9 | 0.0 | 0.0 | 0.0 | 0.0 | 6.0 | 1.0 | 6.0 | 6.0 | 4.0 | 5.0 | 4.0 | 5.0 | 5.0 | 2 |
Last rows
| V0 | V1 | V2 | V3 | V4 | V5 | V6 | V7 | V8 | V9 | V10 | V11 | V12 | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 105898 | 6.0 | 9.0 | 9.0 | 2.0 | 7.0 | 2.0 | 5.0 | 6.0 | 2.0 | 7.0 | 2.0 | 9.0 | 4.0 | 0 |
| 105899 | 0.0 | 4.0 | 6.0 | 0.0 | 9.0 | 6.0 | 8.0 | 6.0 | 5.0 | 2.0 | 7.0 | 3.0 | 6.0 | 3 |
| 105900 | 0.0 | 1.0 | 2.0 | 0.0 | 4.0 | 1.0 | 7.0 | 5.0 | 2.0 | 4.0 | 6.0 | 4.0 | 5.0 | 2 |
| 105901 | 0.0 | 10.0 | 8.0 | 0.0 | 7.0 | 0.0 | 4.0 | 5.0 | 6.0 | 7.0 | 0.0 | 10.0 | 3.0 | 0 |
| 105902 | 0.0 | 2.0 | 4.0 | 10.0 | 10.0 | 9.0 | 3.0 | 3.0 | 0.0 | 3.0 | 9.0 | 2.0 | 6.0 | 3 |
| 105903 | 0.0 | 4.0 | 6.0 | 0.0 | 6.0 | 1.0 | 8.0 | 4.0 | 1.0 | 5.0 | 5.0 | 5.0 | 5.0 | 2 |
| 105904 | 0.0 | 3.0 | 4.0 | 0.0 | 7.0 | 2.0 | 8.0 | 6.0 | 2.0 | 4.0 | 10.0 | 4.0 | 4.0 | 2 |
| 105905 | 0.0 | 7.0 | 1.0 | 0.0 | 4.0 | 0.0 | 8.0 | 6.0 | 2.0 | 5.0 | 4.0 | 7.0 | 4.0 | 2 |
| 105906 | 3.0 | 4.0 | 6.0 | 0.0 | 6.0 | 0.0 | 9.0 | 4.0 | 0.0 | 4.0 | 5.0 | 5.0 | 2.0 | 1 |
| 105907 | 0.0 | 1.0 | 0.0 | 0.0 | 9.0 | 6.0 | 7.0 | 4.0 | 1.0 | 4.0 | 9.0 | 2.0 | 7.0 | 3 |
Most frequent
| V0 | V1 | V2 | V3 | V4 | V5 | V6 | V7 | V8 | V9 | V10 | V11 | V12 | target | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 241 | 10.0 | 0.0 | 0.0 | 10.0 | 10.0 | 0.0 | 0.0 | 3.0 | 2.0 | 0.0 | 0.0 | 10.0 | 4.0 | 0 | 4 |
| 60 | 0.0 | 2.0 | 0.0 | 0.0 | 10.0 | 0.0 | 10.0 | 4.0 | 0.0 | 3.0 | 10.0 | 2.0 | 2.0 | 2 | 3 |
| 80 | 0.0 | 2.0 | 8.0 | 0.0 | 9.0 | 0.0 | 5.0 | 4.0 | 1.0 | 8.0 | 0.0 | 10.0 | 4.0 | 0 | 3 |
| 122 | 0.0 | 4.0 | 7.0 | 0.0 | 4.0 | 0.0 | 5.0 | 6.0 | 4.0 | 7.0 | 2.0 | 10.0 | 3.0 | 2 | 3 |
| 146 | 0.0 | 6.0 | 0.0 | 10.0 | 10.0 | 0.0 | 1.0 | 4.0 | 4.0 | 4.0 | 0.0 | 10.0 | 4.0 | 5 | 3 |
| 161 | 0.0 | 6.0 | 9.0 | 0.0 | 7.0 | 0.0 | 4.0 | 4.0 | 4.0 | 8.0 | 0.0 | 10.0 | 4.0 | 0 | 3 |
| 203 | 0.0 | 10.0 | 8.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0 | 3 |
| 204 | 0.0 | 10.0 | 8.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0 | 3 |
| 213 | 0.0 | 10.0 | 9.0 | 0.0 | 9.0 | 0.0 | 5.0 | 5.0 | 1.0 | 8.0 | 0.0 | 10.0 | 4.0 | 0 | 3 |
| 215 | 0.0 | 10.0 | 10.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 10.0 | 2.0 | 0.0 | 1.0 | 0 | 3 |